KMID : 0917520000070010017
|
|
Journal of Speech Sciences 2000 Volume.7 No. 1 p.17 ~ p.29
|
|
Speech Quality of a Sinusoidal Model Depending on the Number of Sinusoids
|
|
Seo, Jeong Wook
Kim, Ki Hong/Seok, Jong Won/Bae, Keun Sung
|
|
Abstract
|
|
|
The STC(Sinusoidal Transfrom coding) is a vocoding technique that uses a sinusoidal speech model to obtain high quality speech at low data rate. It models and synthesizes the speech signal with fundamental frequency and its harmonic elements in frequency domain. To reduce the date rate, it is necessary to represent the sinusoidal amplitudes and phases with as small number of peaks as possible while maintaining the speech quality. As a basic research to develop a low-rate speech coding algorithm using the sinusoidal model, in this paper, we investigate the speech quality depending on the number of sinusoids. By varying the number of spectral peaks from 5 to 40 speech signals are reconstructed, and then their qualities are evaluated using spectral envelope distortion measure and MOS(Mean Opinion Score). Two approaches are used to obtain the spectral peaks: one is a conventional STFT (Short-Time Fourier Transform), and the other is a multiresolutional analysis method.
Keywords : speech coding, STC, speech quality, sinusoidal model
|
|
KEYWORD
|
|
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|
|